Data models for an integrated thesaurus database
نویسنده
چکیده
This paper presents two data models for storing multiple thesauri in a single integrated database to be used as an aid to searchers in multi-database searching, for the construction of conversion tables between thesauri, and as a tool for constructing and maintaining individual thesauri. The paper first describes the nature of thesaurus data and a relational data structure for such data, which is flexible and — through its use of term numbers in recording relationships — economical in storage. It then describes two data models for structuring an integrated thesaurus database. In both models, general data on terms and relationships are stored once, with indication of one or more sources, resulting in storage economy. The term-based model stores all relationships as relationships between terms. This is flexible but redundant: If the same concept relationship is expressed through different terms in different thesauri, it is stored multiple times in the integrated database. The concept-based model identifies concepts by concept numbers and uses these concept numbers to record concept relationships, thus bringing together all occurrences of the same concept relationship regardless of the terms used to express the related concepts. This results in more compact storage but is less flexible.
منابع مشابه
بررسی مقایسهای روابط معنایی، ساختار شکلی و سیستم مدیریت اصطلاحنامههای فنی ـ مهندسی و نما
Purpose: Thesauri as important tools in storage and retrieval information systems have a significant role in the optimization of database search. So the publishing of thesauri needs to use standards as much as possible. I examined and compared two important thesauruses on the basis of ANSI/NISO z39.19 2005. Methodology: This study is an analytical and applied survey. The study population was t...
متن کاملوضعیت بازیابی اطلاعات در دو پایگاه نمایه و نما و سنجش اثربخشی استفاده از واژگان کنترل شده در نمایهسازی این دو پایگاه
Purpose: This study was carried out to determine the level of precision, recall, and searching time for “Nama” and “Namayeh” databases, as well as to find out which of the indexing tools (thesaurus and Dewey decimal classification) helps us more in improvement of information retrieval. Methodology: This study is an analytical survey in which the necessary data was collected by direct observati...
متن کاملCreating and Querying an Integrated Ontology for Molecular and Phenotypic Cereals Data
In this paper we describe the development of an ontology of molecular and phenotypic cereals data, realized by integrating existing public web databases with the database developed by the research group of the CEREALAB project. This integration is obtained using the MOMIS system (Mediator envirOnment for Multiple Information Sources), a mediator based data integration system developed by the Da...
متن کاملThesaurus-Based Software Environments
Software environments support the process of constructing and maintaining application systems. This paper describes the idea of a thesaurus1 as a viable foundation for software environments. A thesaurus contains information about the names and identifiers in all the software written in all the languages of an application. Information about extensional data in a database or persistent store is a...
متن کاملPrediction of global sea cucumber capture production based on the exponential smoothing and ARIMA models
Sea cucumber catch has followed “boom-and-bust” patterns over the period of 60 years from 1950-2010, and sea cucumber fisheries have had important ecological, economic and societal roles. However, sea cucumber fisheries have not been explored systematically, especially in terms of catch change trends. Sea cucumbers are relatively sedentary species. An attempt was made to explore whether the tim...
متن کامل